Clustering Sequences in a Metric Space The MoBIoS Project

نویسندگان

  • Rui Mao
  • Daniel P. Miranker
  • Jacob Neal Sarvela
  • Weijia Xu
چکیده

We are developing a [Molecular] Biological Information System (MoBIoS) based on metric space indices. Unfortunately, common similarity measures for sequence alignment do not form a metric-distance function. This is particularly vexing since the usual definition of edit distance does form a metric. Most clearly, the use of PAM log-odds matrices [2] yields higher similarity scores for more closely related sequences, an intuitively appealing result that reverses metric order. Further, log-odds scoring matrices contain negative values that can yield negative global alignment scores. This violates positivity. Use of PAM matrices also can violate symmetry and the triangle inequality.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MoBIoS: A Metric-Space DBMS to Support Biological Discovery

MoBIoS is a specialized database management system whose storage manager is based on metric-space indexing, and whose query language entails biological data types. When relational database management systems are used to support biological data, important data types are relegated to blob and unstructured text fields. Consequently, even simple, but critical queries are executed by sequentially du...

متن کامل

Using MoBIoS’ Scalable Genome Joins to Find Conserved Primer Pair Candidates Between Two Genomes

For the purpose of identifying evolutionary reticulation events in flowering plants, we determine a large number of paired, conserved DNA oligomers that may be used as primers to amplify orthologous DNA regions using the polymerase-chain reaction (PCR). We develop an initial candidate set by comparing the Arabidopsis and rice genomes using MoBIoS (Molecular Biological Information System). MoBIo...

متن کامل

Fixed Point Results on $b$-Metric Space via Picard Sequences and $b$-Simulation Functions

In a recent paper, Khojasteh emph{et al.} [F. Khojasteh, S. Shukla, S. Radenovi'c, A new approach to the study of fixed point theorems via simulation functions, Filomat, 29 (2015), 1189-–1194] presented a new class of simulation functions, say $mathcal{Z}$-contractions, with unifying power over known contractive conditions in the literature. Following this line of research, we extend and ...

متن کامل

Using MoBIoS' scalable genome join to find conserved primer pair candidates between two genomes

MOTIVATION For the purpose of identifying evolutionary reticulation events in flowering plants, we determine a large number of paired, conserved DNA oligomers that may be used as primers to amplify orthologous DNA regions using the polymerase chain reaction (PCR). RESULTS We develop an initial candidate set by comparing the Arabidopsis and rice genomes using MoBIoS (Molecular Biological Infor...

متن کامل

A note on convergence in fuzzy metric spaces

The sequential $p$-convergence in a fuzzy metric space, in the sense of George and Veeramani, was introduced by D. Mihet as a weaker concept than convergence. Here we introduce a stronger concept called $s$-convergence, and we characterize those fuzzy metric spaces in which convergent sequences are $s$-convergent. In such a case $M$ is called an $s$-fuzzy metric. If $(N_M,ast)$ is a fuzzy metri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002